High-Throughput Sequencing and De Novo Assembly of the Isatis indigotica Transcriptome
نویسندگان
چکیده
BACKGROUND Isatis indigotica, the source of the traditional Chinese medicine Radix isatidis (Ban-Lan-Gen), is an extremely important economical crop in China. To facilitate biological, biochemical and molecular research on the medicinal chemicals in I. indigotica, here we report the first I. indigotica transcriptome generated by RNA sequencing (RNA-seq). RESULTS RNA-seq library was created using RNA extracted from a mixed sample including leaf and root. A total of 33,238 unigenes were assembled from more than 28 million of high quality short reads. The quality of the assembly was experimentally examined by cDNA sequencing of seven randomly selected unigenes. Based on blast search 28,184 unigenes had a hit in at least one of the protein and nucleotide databases used in this study, and 8 unigenes were found to be associated with biosynthesis of indole and its derivatives. According to Gene Ontology classification, 22,365 unigenes were categorized into 48 functional groups. Furthermore, Clusters of Orthologous Group and Swiss-Port annotation were assigned for 7,707 and 18,679 unigenes, respectively. Analysis of repeat motifs identified 6,400 simple sequence repeat markers in 4,509 unigenes. CONCLUSION Our data provide a comprehensive sequence resource for molecular study of I. indigotica. Our results will facilitate studies on the functions of genes involved in the indole alkaloid biosynthesis pathway and on metabolism of nitrogen and indole alkaloids in I. indigotica and its related species.
منابع مشابه
Clustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملCombined transcriptome and metabolite profiling reveals that IiPLR1 plays an important role in lariciresinol accumulation in Isatis indigotica.
A lignan, lariciresinol, is an important efficacious compound for the antiviral effect of Isatis indigotica, a widely used herb for the treatment of colds, fever, and influenza. Although some rate-limiting steps of the lariciresinol biosynthetic pathway are well known, the specific roles of gene family members in I. indigotica in regulating lariciresinol production are poorly understood. In the...
متن کاملTranscriptomic Analysis Reveals Differential Gene Expressions for Cell Growth and Functional Secondary Metabolites in Induced Autotetraploid of Chinese Woad (Isatis indigotica Fort.)
The giant organs and enhanced concentrations of secondary metabolites realized by autopolyploidy are attractive for breeding the respective medicinal and agricultural plants and studying the genetic mechanisms. The traditional medicinal plant Chinese woad (Isatis indigotica Fort., 2n = 2x = 14) is now still largely used for the diseases caused by bacteria and viruses in China. In this study, it...
متن کاملChimeraScan: a tool for identifying chimeric transcription in sequencing data
SUMMARY Next generation sequencing (NGS) technologies have enabled de novo gene fusion discovery that could reveal candidates with therapeutic significance in cancer. Here we present an open-source software package, ChimeraScan, for the discovery of chimeric transcription between two independent transcripts in high-throughput transcriptome sequencing data. AVAILABILITY http://chimerascan.goog...
متن کاملhtSeqTools: high-throughput sequencing quality control, processing and visualization in R
UNLABELLED We provide a Bioconductor package with quality assessment, processing and visualization tools for high-throughput sequencing data, with emphasis in ChIP-seq and RNA-seq studies. It includes detection of outliers and biases, inefficient immuno-precipitation and overamplification artifacts, de novo identification of read-rich genomic regions and visualization of the location and covera...
متن کامل